COMPLEXITY OF PATH-FOLLOWING METHODS FOR THE 
EIGENVALUE PROBLEM 



DIEGO ARMENTANO 

Abstract. In this paper we study path-following methods for the eigenvalue 
problem. We introduce a projective framework to analyze this problem. We 
define a condition number and a Newton's map appropriate for this context, 
proving a version of the 7-Theorem. Our main result bounds the complexity 
of path-following methods in terms of the length of the path in the condition 
metric. 



1. Introduction And Main Results 

1.1. Introduction. In this paper we study the complexity of path- following meth- 
ods to solve the eigenvalue problem: 

(A/„ -A)v = 0, 0, 

where A e K nx ™ (IK = M or C), v G I", Ael. Classical algorithms for solving the 
eigenvalue problem may be divided into two classes: QR methods (including Hes- 
senberg reduction, single or double shift strategy, deflation), and Krylov subspace 
methods; see Wilkinson [22] , Golub-van Loan [T3], Stewart [IS], Watkins [2"T] . 

Surprisingly, the complexity of the eigenvalue problem is still an open problem. 
It may be formulated in the following terms: given an algorithm designed to solve 
the eigenvalue problem, 

(1) For which class of matrices does it converge ? 

(2) What is the average number of steps, in a given probabilistic model on the 
set of inputs, to obtain a given accuracy on the output ? 

The two following examples show that such questions are particularly difficult: 

• QR algorithm with Wilkinson's single shift diverges for a non-empty open 
set of matrices (see Batterson-Smillie [IJ[2]). 

• QR algorithm is convergent for almost every complex matrix. However, 
even for the choice of Gaussian Orthogonal Ensamble, as a probabilistic 
model, question (2) remains unanswered (see Deift |10|). 

In this paper we consider the eigenvalue problem as a bilinear polynomial system 
of equations and we consider homotopy methods to solve it. The system (A/„ — 
A)v = 0, v 7^ 0, is the endpoint of a path of problems 

(A(t)I„ - A(t))v(t) = 0, v(t) £ 0, < t < 1, 

with (A(l), A(l),u(l)) = (A,X,v). Starting from a known triple (A(0), A(0), u(0)) 
we "follow" this path to reach the target system (A/„ — A)v = 0. The algorithmic 
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way to do so is to construct a finite number of triples 

{A k ,X k ,v k ), 0<k<K, 

with A k = A(tk), and = to < t\ < . . . < tic = 1, and where A&, v k are approx- 
imations of X(t k ), v(tk)- The complexity of this algorithm (defined more precisely 
below) is measured by the number K of steps sufficient to validate this approxima- 
tion. In this paper we relate K with a geometric invariant, namely, the condition 
length of the path. 

We begin with the geometric framework of our problem. Since the eigenvalue 
problem is homogeneous in v g K n and in (A, A) g K" x " x K, we define the solution 
variety as 

V =: {(A, A, v) eP(K" x " x K) xP(K") : (AJ n - A)v = 0} , 

where P(E) denotes the projective space associated with the vector space E. We 
speak interchangeably of a non zero vector and its corresponding class in the pro- 
jective space. 



1.2. A bihomogeneous Newton's method. Given a non-zero matrix A £ K nxn , 
we define the evaluation map F A ■ K x K n — > K n , by 

F A (X,v) := (XI n -A)v. 

Associated to F A we define N A :Kx (K n - {0}) — > K x K" , given by 

(1.1) N A (X,v) := (A,«) - ( j DF j4 (A,u)| Kx „x)- 1 F j4 (A,i;), 

defined for all (A, v) such that DF A (X, v)\ Kxv ± is surjective. Here v 1 is the Hermit- 
ian complement of v in K™. This map is homogeneous of degree 1 in ii, therefore, 
N A induces a map from K x P(K n ) into itself. 

We define the Newton map N on (K nxn - {0„}) x K x (K" - {0}) by 

N(A,X,v) := (A,N A (X,v)). 

This map TV is a bihomogeneous map of degree 1 in (A, A) and v. Hence N is 
well-defined on P(K" X ™ x K) x V(K n ) (see Section^. 

Given A e K" xn , A ^ 0„, and (Ao,Uo) G K x K", u 7^ 0, the Newton sequence 
associated to A is defined by 

(A,X k+ i,v k+1 ) :=N(A,X k ,v k ), k > 0. 

We say that this sequence converges immediately quadratically to a solution of the 
eigenvalue problem (^4, A, v) € V when 

d V 2 {{A, X k ,v k ), (A, A, «)) < • d p2 ((A, A , «o), (A, A, »)) , 

for all positive integer k. Here dp2(-,-) is the induced Riemannian distance on 
P(I" X " x K) x P(K n ) (see Section\tj$. In that case we say that (A, X ,v Q ) is an 
approximate solution of the eigenvalue problem (A, A, v) G V. 
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1.3. The predictor-corrector algorithm. Let T(t) — (A(t),X(t),v(i)), a <t < 
b, be a representative path in V. To approximate the path T by a finite sequence 
we use the following predictor-corrector strategy: given a mesh a = to < t± < . . . < 
t K = b and a triple (A(t ), A , v ) G K nxn x K x K", (v ^ 0), we define 

(A{t k+1 ),X k+ i,v k+1 ) := N(A(t k+1 ),X k ,v k ), Q<k<K-l, 

(in case the Newton map is defined). We say that the sequence (A(t k ), X k) v k ), 
< k < K, approximates the path T(t), a < t < b, when, for any k — 0, . . . ,K, 
(A(t k ), X k ,v k ) is an approximate solution of the eigentriple T(t k ) = (A(t k ), X(t k ), v(t k )) G 
V. In that case we define the complexity of the sequence by K. 

1.4. Condition of a triple and condition length. Let W C V be the set of 

well-posed problems, that is the set of triples (A, A, v) G V such that A is a simple 
eigenvalue (see Section^. In that case, for a fixed representative (A, A, v) G V, the 
operator ILji (XI n —A)\ v x is invertible, where II,^ denotes the orthogonal projection 
of K" onto v- 1 . The condition number of (A, A, v) is defined by 



(1.2) 



:{l,\\A\\ F ■ \\Tl v ±(XI n - A)^- 1 ^ , 



where \\-\\f and || • || are the Frobenius and operator norms in the space of matrices. 
We also let n(A, A, v) = oo when (A, A, v) e V - W; (see Section^. 

When T(t), a < t < b, is an absolutely continuous path in W, we define its 
condition-length as 



(1.3) 



where 



w ■= 



f(t) 



r(t) 



li(T(t)) dt, 



r(t) 

r(t) 

on V (see Section 12. ip 



is the norm of T(t) in the unitarily invariant Riemannian structure 



1.5. Main Results. The main theorem concerning the convergence of Newton's 
iteration is the following. 

Theorem 1. There is a universal constant uq > with the following property. 
Let (A,X,v), (A,X ,v ) G F(K nxn x K) x P(K"). If (A, A, v) G W and 

d P 2 ((A, Ao, wo), (A, A, u)) < "° — -, 

/i(A, A,u) 

then, (A,Xq,Vo) is an approximate solution of(A,X,v). 
(One may choose uq = 0.0739,). 

Theorem [7] is a version of the so called ^-theorem (see Blum et al. [3]), which 
gives the size of the basin of attraction of Newton's method. Different versions 
of the 7-theorem for the symmetric eigenvalue problem and for the generalized 
eigenvalue problem are given in Dedieu [7] and Dedieu-Shub [9] respectively. 

Theorem [7] is the main ingredient to prove complexity results for path- following 
methods. 

The proof of Theorem [JJ follows from a version of the 7-theorem for the map 
N A :Kx P(K") — > IK x P(K") which is interesting in itself (see Section^. 

Following these lines our main result is: 
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Theorem 2. There is a universal constant C > such that for any absolutely 
continuous path T in VV, there exists a sequence which approximates T, and such 
that the complexity of the sequence is 



(One may choose C = 120). 

The proof of Theorem fj| is given in Section \5\ 

1.6. Comments. In their seminal paper [T7|, Shub and Smale relate, in the context 
of polynomial system solving, the complexity K to three ingredients: the degree of 
the considered system, the length of the path T(t), and the condition number of 
the path. Precisely, they obtain the complexity 



where C is a universal constant, D is the degree of the system, £(T) is the length of 
r in the associated Riemannian structure, and (J,(T) — sup a<t<b [i (r(i)). Similar 
results for the generalized eigenvalue problem were obtained in Dedieu-Shub [5]. 

In Shub [16j the complexity K of path-following methods for the polynomial 
system solving problem is analyzed in terms of the condition length of the path. 

In the context of polynomial system solving, the eigenvalue problem may be 
considered as a quadratic system of equations. However, [T7] and [TB] do not apply 
since the eigenvalue problem as a quadratic system belongs to the subset of ill-posed 
problems of generic quadratic systems. (See Li [14J). Therefore, in order to analyze 
the complexity of the eigenvalue problem, a different framework is required. Here 
we consider the eigenvalue problem as a bilinear problem (see Subsection \2.2.1\ . 

The approach considered in this paper is greatly inspired by Shub |16| . 

Remark 1.1. Our next objective is to transfer these theoretical considerations into 
a practical algorithm. This issue will be considered in another paper. For the 
case of polynomial system solving see Beltran [3], Burgisser-Cucker [5], Dedieu- 
Malajovich-Shub [S]. 

Note: Throughout this paper we will work with K = C. However most definitions 
and results can be extended immediately to the case K = R. Whenever it is 
necessary we shall state the difference. 



In this section we define the canonical metric structures and study some basic 
topological and algebraic properties of the solution variety. 

2.1. Canonical Metric Structures. The space l n is equipped with the canonical 
Hermitian inner product (•,•). The space K nx " is equipped with the Frobenius 
Hermitian inner product 



where B* denotes the adjoint of B. 

In general, if E is a finite dimensional vector space over K with the Hermitian 
inner product (•, •), we can define an Hermitian structure over P(E) in the following 
way: for igE, 



K < CD^ 2 £(T)fi(T) 2 ), 



2. Riemannian Structures and the Solution Variety 



(A, B) F := trace (B*A), 



(w, w') 



x 
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for all w, w' in the Hermitian complement x 1 - of x in E, which is a natural represen- 
tative of the tangent space T X P(E). Let dr(x,y) be the angle between the vectors 
x and y. 

In this way, the space P(K" xn x IK) x P(K") inherits the Hermitian product 
structure 

(2-1) \UXv)\\Ia,x, v) = \\{AMIa,x) + MI 

for all (A, X,v) G (A, A) 1 - x tr 1 . 

We denote by d P 2 (•, •) the induced Riemannian distance on P(K nXTl x K) x P(K"). 

Throughout this paper we denote by the same symbol dp distances over P(K n ), 
P(K" X ") and P(K" X " x K). 

2.2. The Solution Variety V. Recall that the solution variety VcP(K" x "xK)x 
P(K") is given by the set of triples (A, X, v) such that (XI n -A)v = 0. Note that V is 
the set of equivalence classes of the set {F = 0}, where F : K" x ™ xKx (K n — {0}) — > 
K n is the multihomogenous system of polynomials given by 

(2.2) F(A, X, v) = (XI n ~ A)v. 

Therefore V is an algebraic subvariety of the product P(K nxn xl)x P(K"). More- 
over, since is a regular value of F, we conclude that V is also a smooth submanifold 
of P(K nx ™ x K) x P(K n ). Its dimension over K is given by 

dim V = dim(K" xn x K x K n ) - n - 2 = n 2 - 1. 

The tangent space T(a,\,v)V to V at (A, X, v) is the set of triples 

(A,\,v) G K nx ™ x K x K™, 

satisfying 

(2.3) (XI n - A)v + (XI n - A)v = 0; (A, A) F + XX = 0; (v,v)=0. 

Remark 2.1. The solution variety V inherits the Hermitian structure from P(K™ x,l x 
K) x P(K") defined in (|2~Tj) . 

We denote by 7Ti and 7T2 the restriction to V of the canoncial projections onto 
P(l" x " x K) and P(K n ) respectively 

Note that m(V) C P(K" xn x K) does not include the pair (0„, 1). Therefore we 
can define the map 

7T : V -> P(K nx "), 7r:=po7Ti, 
where p is the canonical projection 

(2.4) p:P(r x "xl)-{(0„,l)}4P(P x "), p(A,X)=A. 



V 




P(K nx ") 
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The derivative 

(2.5) Dtv(A, A, v) : T (A ^ v) V -► T A P(K" X "), 

is a linear operator between spaces of equal dimension. 

Definition 1. We say that the triple (A, A, v) G V is well-posed when Dir(A, X,v) 
is an isomorphism. Let W be the set of well-posed triples, and £' := V — W be the 
ill-posed variety. Let £ = vr(E') C P(K nXTl ) be the discriminant variety, i.e. the 
subset of ill-posed inputs. 

Lemma 2.1. £' is the set of triples (A, X,v) £ V such that A is no£ a simple 
eigenvalue. 

Proof. The linear operator ()2.5|) is given by 

Dtt(a, a, u)(i, a» = A + • A, (A, a, «) e r ( A,A J »)V. 

According to (|2.3[) . a non-trivial triple in the kernel of I?7r(^4,A,w) has the form 
(tjw- A, A, v), where (v, v) = 0, v ^ 0, and 

II^IIf 

A (l + -^^j v + (XI n - A)v = 0. 

Then, rank[(A/„ - A) 2 ] < n - 1. □ 
Corollary 1. £' is an algebraic subvariety ofV. 

Remark 2.2. When K = C, by the Main Therorem of elimination theory (cf. Mum- 
ford [TSl pp. 33]) and the fact that the projection p is Zariski-closed (cf. [TSJ 
Corollary 2.28]), we conclude from Corollary \T\ that S is an algebraic subvariety of 
P(K nx "). 

Remark 2.3. The solution variety V is connected since each (A, X,v) £ V can be 
connected by a path (in V) with a triple of the form (vv* , \\v\\ ,v) € V. Here v* is 
the conjugate transpose of v. 

Lemma 2.2. (i) When K — C,Wis connected. 
(ii) When K = M. W /ia,s £wo connected components. 

Proof, (i) Since V is connected, the result follows from Corollary [7] and the fact 
that a complex algebraic subvariety of V can not disconnect it (cf. Blum et al. [2J 
pp. 196]). 

(ii) It is enough to prove the lemma in the afhne case. Let V and W be the afhnc 
spaces associated to V and W. Let ip : V — > W l x M" xn be the continuous map 
given by ip(A, A, v) ~ (v, XI n — A). Define the subsets 

C := {(w, M) e R n x M" xn : Mw = 0} 

and 

B := {(w, M ) € £ : rank(Af + ww T ) = n}. 

Note that (p projects V onto £, and therefore C is connected. Moreover ip(W) = B. 
The second assertion follows from the fact that H W ±M\ W ± — H W ±(M + ww T )\ w ±, 
for all (w, M) £ C. 

Note that, for all (w,M) £ B, ip- 1 (w 1 M) = {(M + al n .a,w) : a £ R} is a one 
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dimensional subspace of VV. Therefore, the set B := {(M, 0, w) : (w, M) £ B} is a 
deformation retract of W. It is clear that B and B are homeomorphic. 

Then, the lemma follows from the fact that B has two connected component on 
C. □ 

2.2.1. Multidegree ofV. In this item we will see that the bilinear approach consid- 
ered in this paper gives the correct number of roots of the eigenvalue problem. 

For the sake of simplicity in the exposition, we will restrict ourself to the case 
IK = C. This subsection follows closely D'Andrea-Krick-Sombra [6]. 

Since V is an algebraic subvariety of the product space P(C nx ™ x C) x P(C n ), 
there is a natural algebraic invariant associated to V, namely, the multidegree of 
V. This invariant is given by the numbers deg ( -„2_ 1 _ i i ) (V), i = 0, . . . , n — 1, where 
deg(„2_ 1 _ i i)(V) is the number of points of intersection of V with the product A x 
A' C P(C nx ™ x C) x P(C"), where A C P(C nx ™ x C) and A' C P(C n ) are generic 
(n 2 — 1 — ?')-codimcnsion plane and z-codimension plane respectively (see Fulton 
USD- 
Lemma 2.3. deg (tl 2_ 1 _ i)i )(V) = (,"i) for i = 0, ...,n- 1. 

In order to give a proof of this lemma we recall some definitions from intersection 
theory (cf. Fulton [12]). (See also [6]). 

The Chow ring of P(C" xn x C) x P(C n ) is the graded ring 

A* (P(C" xn x C) x P(C n )) = Z[u l ,u 2 ]/(uf +l ,u^), 

where uj\ and L02 denotes the rational equivalence classes of the inverse images of 
hyperplanes of P(C" X ™ x C) and P(C"), under the projections P(C nx ™ x C) x 
P(C n ) -> P(C nx " x C) and P(C" xrl x C) x P(C n ) -> P(C") respectively. 

Given a codimension n algebraic subvariety X C P(C nx ™ x C) x P(C"), the class 
of X in the Chow ring is 

n-l 

[*] = '£deg in i_ 1 _ iii) (X)cj i + 1 u;r 1 - i G A* (P(C" X " x C) x P(C")) . 

Proof of Lemma \2.3[ Let F^, (i = 1, ...,n), be the coordinate functions of F de- 
fined in (|2.2j) . Since Fj is bilinear for each i, we have that the class of {Fi — 0} C 
P(C nx ™ x C) x P(C") is given by 

[{Fi = 0}] = ui + uj 2 £ A* (P(C nx ™ x C) x P(C n )) , (i = 1, . . . , n). 

Then, the class of V in the Chow ring is 

n 

[V] = [{F\ = 0} n • • • n {F n = 0}] = l[[{Fi = 0}], 

i=l 

where the last equality follows from Bezout identity. Therefore, one gets 
that is 



ft 



□ 



s 
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Proposition 2.1. For all A £ P(C nx ") - E we have #tt- 1 (A) = dcg ( „ 2 _ li0) (V) = 
n. 

Proof. Since P(C nx ™)— E is connected, the number of preimages under it is constant 
on it. From Lemma[tl\we get that the restriction n^v-v ■ V-E' -> P(C" xn x C) 
is a bijective map onto its image 7Ti(V — E'). Therefore, given A £ P(K" xn ) — E, 
we have ^fn^ 1 (A) = #p| 7ri (v) (A), where p is the projection map given in (|2.4|) . 
Moreover, from [15j Corollary 5.6], we get that #p\ ni (v)~ (A) = deg7Ti(V), where 
deg is the degree of the projective algebraic subvariety tti (V) C P(C ,lXTl x C) . Since 
dim7Ti(V) = dim(V) and the fact that 7ri|v-S' : V — E' —> 7Ti(V — E') is bijective, 
we get that #(A x P(C")) n V = #A n ^(V), for a generic (n 2 - l)-codimension 
plane A C P(C" X ™ x C). Then, we obtain that degTr^V) = deg (n2 _ 1)0 )(V). □ 

Remark 2.4. From this proposition we get that the map 7r|v-7r- 1 (s) : V — 7r _1 (E) — > 
P(C™ xn ) — E is a n-fold covering map. 

2.2.2. Unitary Invariance. Let U„(K) stand for the unitary group when K = C or 
the orthogonal group when K = R. The group U„(K) acts on P(K n ) in the natural 
way, and acts on K" x ™ by sending A i-> UAU^ 1 . Moreover if (A, X,v) £ V, then 
(UAU^ 1 , A, Uv) £ V. Thus, V is invariant under the product action U„(K) x V — > V 
given by 

(2.6) U- (A,X,v) ^ {UAXJ- 1 ,\,Uv), U £ U„(K). 

Remark 2.5. Note that the group U„(K) preserves the Hermitian structure on V, 
therefore U„(K) acts by isometries on V. 

3. Condition Numbers 

In this section we introduce the eigenvalue and eigenvector condition numbers, 
and we define the condition number for the eigenvalue problem. We will discuss the 
condition number theorem for this framework, which relates the condition number 
with the distance to ill-posedness. In the last part of this section we study the rate 
of change of condition numbers. 

3.1. Eigenvalue and Eigenvector Condition Numbers. When {A, A, v) be- 
longs to W, according to the implicit function theorem, 7r has an inverse defined 
in some neighborhood U A C P(K" xn ) of A such that tt" 1 ^) = (A, X,v). This 
map = tt~ 1 \u a '■ ^ V is called the solution map. It associates to any matrix 
B £ Ua the eigentriple (B, A_b, Vb) close to (A, A, v). Its derivative 

D5?(A, A, v) : T A P(K nxn ) -> T {A ^ v) V, 

is called the condition operator at (A, A, v). 

If (A, A, v) £ W, the derivative Df(A, A, v) associates to each B £ T A F(K nXn ) 
a triple (A, A, v) satisfying ()2.3|) . Moreover, equation (|2.3p defines two linear maps, 

DJ? x (A,\,v)B = (i,A) and DS%(A, A, v)B = v, 

namely, the condition operators of the eigenvalue and eigenvector respectively. 



Lemma 3.1. Let (A, A, v) £ W. Then for B £ T A P(K nxn ), one gets: 
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(i) 



DS? x (A,\,v)B = (B-X-^—A,X), where A = B '''' ! 



(ii) 

Dy v {A, X, v)B = IV (A/„ - A)\ v ± - 1 (ll v ± (Bv)^j , 

where u G K n is a left eigenvector of A with eigenvalue X: a non-zero vector 
satisfying (XI n — A)*u = 0. 

Proof, (i): Note that the relation between B e A 1 - and (A, A) £ (A, A)- 1 is given 

by 

(3 - 1} B = A+ im A 

Moreover, from (|2.3[) we get (Av,u) = X(v,u), i.e. 

(3.2) A=^f 

{v,u) 

From dHII]) and ([321) follows 

1 (Bv,u) 



X = 



1 + Mr («•«) ' 

I^IIf 

(ii): From ()2.3)) again one gets 

v = u v ±(\i n - A)^- 1 (n„x(i«)) . 

Since H„_l = (Au) by ([33)1 . the result follows. □ 

Since P(K nx ") is equipped with the canonical Hermitian structure induced by 
the Frobenius Hermitian product on K" x ™, the condition numbers of the eigenvalue 
and eigenvector are given by 

fj, x (A,X,v)= sup \\Dyx(A,X,v)B\\ (Aj x) 

bea 1 - 

\\B\\f = \\A\\ f 

fi v (A,X,v)= sup \\D.y v (A,X,v)B\\ v 

BeA 1 - 

\\B\\f = \\A\\ f 

Proposition 3.1. Let (A, A, v) € W. Then 
(i) 

1 



fix(A,X,v)= XJ 

1 + im 



, \\v\\ 2 -\\u\r 1/2 

\{v,u)\* _ 



(ii) 

lh,(A,\,v) = \\A\\ F • ||n^(A/ n - A)\ v ± % 
where \\ ■ \\ is the operator norm. 

Remark 3.1. H v ±(XI n — A)\ v ± is a linear map from the Hermitian complement of 
v in K. n into itself. Hence the operator norm of its inverse is independent of the 
representative of v. 
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Proof, (i): From Lemma \3A 



\DJ? x (A,\,v)B" 2 



(A,A) 



|A| 2 (l + ^) 



(3.3) 



\\M%+ \M 2 



\A\ 



{Bv,u) 






{v 7 u) 







ll^ + |A| 2 



Then, the proof of (i) can be deduced from the following result: 

\\A\\ F -J\\v\\ 2 -\\uf 



sup 

|B|lip = l|A|| 



[Bv,u) 



|A| 2 

WIS- 



\{v,uW 



(The proof is left to the reader) . 

(ii): Since Av = Xv, we have H v ±(Bv) = H V ±((B + aA)v), for any a € K and 
B € A- 1 . Then, from .Lemma HOI we get: 



sup 

I|b||f=||A|| f 



sup IX,;-L(A7 n - A)\ v ± '(Tl^iBv) 

I|b|If=i 



Since {n„i(Bu) : B € K" x ", \\B\\ F = 1} fill the ball of radius \\v\\ in u x , the result 
follows. □ 

Corollary 2. /i\ and fi v are invariant under the action o/U n (]K). 

Remark 3.2. Let (A, X,v) € W. If (A/„ - A)*v = 0, that is, if v is also a left 
eigenvector of A with eigenvalue A, then, 



l^\(A, A, v) 



V2 



1 



In particular, this is the case when A is normal, i.e. A* A = AA* . On the other 
hand, /i„ happens to be more interesting since, roughly speaking, it measures how 
close to A others eigenvalues are. 

Lemma 3.2. Let A e P(K" xn ) be a normal matrix. If (A,X,v) e W then 

»v(A,X,v)= . ll ^\ r 
muij | A — Xi \ 

where the minimum is taken for Xi eigenvalue of A different from X. 

Proof. Since A is normal, by the unitary invariance of /z„, we may assume that 
A is the diagonal matrix Diag(A, A2, . . . , A n ), where A, Xi are the eigenvalues of A. 
Moreover, since (A, A, v) <E W, A ^ Xi for i = 2, . . . n. Then, the result follows from 
Proposition \3.1\ □ 



COMPLEXITY OF EIGENVALUE PROBLEM 



11 



3.2. Condition Number Revisited. The condition number of a computational 
problem is usually defined as the operator norm of the map giving the first order 
variation of the output in terms of the first order variation of the input. In our 
case the condition number should be the operator norm of the condition operator 
DS p (A,X,v) given in Section^ i.e. 

\\Dy(A,\v)\\:= sup \\D^(A,X,v)B\\(A,x,v)- 

bea 1 - 

\\B\\f = \\A\\ f 

Note that this quantity is bounded below by fj, v (A, A, v) and above by (/za(^4, A, v) 2 + 
fi v (A, A, v) 2 ) 1 / 2 . However, in spite of this definition, we define the condition number 
of the eigenvalue problem in the following way. 

Definition 2 (Condition Number) . The condition number of the eigenvalue prob- 
lem is defined by 

(3.4) fi(A,\,v) := xaax{l, fi v (A, X,v)}. 

In the next proposition we show that this definition and the usual one are essen- 
tially equivalent. 

Proposition 3.2. Let (A, A, v) € W. Then 

-j=-ti(A,X,v) < \\DJ?(A,X,v)\\ <2-n{A,X,v) 

The proof follows from the next lemma. 

Lemma 3.3. Let (A, A, v) G W. Then, 

(i) Hv{A,X,v) > I/n/2; 
(ii) 

Ma (A, A, v) < ^js- • (2 + fx v (A, A, vf) 1 ' 2 . 

Proof. Fix a representative of (A,X,v) e W such that |jw|| = f. 
(i): One has, 

\\U v ^(XI n -A)\^\\ < \\U V ^(A)\ V ^\ + \X\<V2\\A\\ F , 
that is, ||II„x(AI„ - A)|„x|| < y/2. Therefore, 

I = || (n„x (A7„ - A)\ v ±)~~ 1 U v ± (XI n - A)\ v ± || < V2fx v (A, X, v). 



(ii): Since the action of U n (K) on P(K n ) is transitive, we may assume that v is 

the first element of the canonical basis. Then A has the form ( ^ I , where 

V° A 

e K lx(n-1) and I e K (n-l)x(n-l)_ Xhcn A _ XI n = (° n j ™ t ). Note 



v A-XI n -i 

that u = (I,— (A — XL n ^i)~*w*) T is solution of (^4 — XL n )*u = 0, i.e. u is a 
left eigenvector. Here, - T and •* denotes the transpose and conjugate transpose 
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respectively. Then 

NHMI 



> 



> 



l + IKA-A/n-x) W *|| 2 

1 

l + IKi-AJ^!)- 1 !! 2 -!^!! 2 
1 



The result now follows from Provosition \3. 1\ □ 

The next subsection is included for the sake of completeness but is not needed 
for the proof of our main results. 

3.3. Condition Number Theorems. In this subsection we study the relation of 
H\(A, A, v), fJ. v (A, A, v) and A, v) with the distance of (A, A, v) to E'. The main 
result in this subsection is that n(A, A, v) is bounded above by sin(dp2 (A, A, v), E')) _1 . 

Let (E, (•, •)) be a finite dimensional Hermitian vector space over K. Given A a 
projective subset in P(E), we denote by A C E its affine extension. 

Lemma 3.4. Given i e E, i / 0, we have 

sm(df(x, A)) = — — - — , 

IMI 

where rfg is the distance generated by (•, •). 

Proof. The proof is straightforward. □ 

The next proposition is a version, adapted to this context, of known results given 
by Wilkinson and Shub-Smale [Tg] . 
Recall that E = tt(E') c ¥(K nxn ). 

Proposition 3.3. Let (A, A, v) £ W. Then 
(i) 



(j,\(A,\,v) < ^ 

(ii) 

fJh(A, A, v) = 



sin(d P (A,E)) 2 
\\A\\f 



d F {A,T, v + \I n ) 
where S„ = {Be P(K" X ") : (B,0,v) € E'} C E. 

Proof, (i) Let £ C K nx ™ be the affine extension of E in K" x ™, and let u be a left 
eigenvector associated to A with eigenvalue A. Wilkinson shows that: 



Ml • IMI <V 2, PI 



F 



IMI " dp(A, E) : 
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(cf. Demmel [IT], Wilkinson [23]). Then, (i) follows from Proposition \3.1\ and 
Lemma \3.4\ 

(ii) In Shub-Smale 18 it is proved that, for a fixed triple (A, A, v) E V, 
d F (XI n — A, £„) = =— . 

lin^CA/n-^U" 1 !! 

Then, (ii) follows from Lemma [3~7\ □ 
Corollary 3. For (A, A, v) e W, we get 

fi(A,X,v)< 



sm(dv(A, £)) ' 

L 

iu«(AA,») < 



Proof. Since £„ + a/ n C £ for all a € K, we conclude from Lemma \3~4\ that : 

1 



sin(d P (A, £)) ' 

Moveover, since the second member is greater than one, the proof follows. □ 
Proposition 3.4. For (A, A, v) £ W, we get 

n(A,\,v)< ' 



sin(d P 2 ((A, A, «),£')) 

Proof. Let £'„' := {(£>, r/) € P(K nXTl x K) : (B,r),v) G £'}, and £'„' its afhne 
extension in K nxn x K. Note that 

4»»xk((A A), £") = d F (A - AJ„, E„), 
where £„ is defined in Provosition \3.3\ Then, from Provosition I ff. 31 we get 

4«"xk((A A),£") = — 7T~i — \- 
Since Trf 1 ^') C £', we get 

d r2 ((a, a, v), e') < d p2 ((^, a, wj^rHs")) = <M(A a), e;o). 

Then, the result follows from the fact that sin(-) < 1. □ 



3.4. Condition Number Sensitivity. For the proof of Theorem [H we have to 
study the rate of change of the condition number /i defined in (|3.4[) . 
The main result of this subsection is the following. 

Proposition 3.5. Given e > 0, there exist C e > such that, if (A, X, v), (A' , A', v') 
belongs to W and 

d r 2((A,X,v),(A',X',v')) < Cg - 

then 

fj,(A,X,v) 



l + e 

arctan 



< M (A',A'y) < (l+e)n(A,X,v). 



(One may choose C e = V2 ( f +^° (1+£>; , where a := (1 + v^V^. 
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Before proving Proposition \3.5\ we need some additional notation. 
When E is a finite dimensional vector space over K equipped with the Hermitian 
inner product (■,■), we define 

(3.5) dr(w, w') := tan(dp(w, w')), 

for all io, w' € P(E). We have 

d T (w,w') = \\w - w'\\ w , 

whenever w and w' satisfy (w — w' ,w) = 0. 

Note that dp(-, •) < dr(-, ■)■ Moreover, we have: 

Lemma 3.5. Let w, w' S P(E) such that d P (w,w') < 8 < tt/2. Then 

d r (w,w') < d T {w,w') < t&n }^ -d r (w,w'), for allw, w' € P(E). 



Proof. This follows from elementary facts. 

Given w € K n , w ^ 0, we define for any B £ K nxn the map 
fl w ±E> : K n -¥ K n , by fi w ±B := r o U W ±B, 
where t : w — > K" is the inclusion map. That is, 



□ 



IL„_lBz = Bz- (Bz, 



w w 
\w\\ \\w\\ 



Since (fl v ± (AJ„ - A)) v = for all (A, A, v) e W, then wc have 



H v (A,X,v) = \\A\\ F ■ U v ±(XI n — A)\ v 



i\x(A7 n -A) 



where f is the Moore-Penrose inverse. 

Lemma 3.6. Let v, w G P(K") and B E K nxn . Then 



IX,,j__D — H,, n ±B 



< 2\\B\\ ■ d T (v,w). 



Proof. Take representatives of v and w such that \\v\\ = 1 and (v — w,v) — 0. Let 
mGK", then 



< 



- (Bu, v)v -(Bu- (Bu, 

IMI IMI 



\\w\\ \\w\\ 



(BU, -rr—rr ~ v) jr—rr + (Bu,V) — -V 

ir \\w\\ \\w\ 



< 2\\Bu\\ 



w 



< 2\\Bu\\ ■ d T {v,w). 



a 



Let d T 2 be the product function defined over P(K nXTl x K) x P(K") by 
d T , ((A, X, v), (A',X',v')) := (d T ((A, A), [A 1 , A')) 2 + d T (v, v ') 2 ) 1/2 
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Proposition 3.6. Let a := (1 + \/E)2y/2. 

Let (A, A, v), (A\ A', v') e W such that 

d r2 ((AA )W ),(A',A',«')) < 

a- fJ, v {A,X,v) 

Then, the following inequality holds: 

(1 + V2d T 2 ((A, A, v), (A' , A' , «')) ) • fi v (A, A, v) 



IXv{A , A , v ) < 



1 - a ■ fx v {A, A, u) • d T 2 ((A, A, u), (A 1 , A', u')) 



Proof. Consider representatives of (A,X,v) and (A',A',i/) such that: \\A\\p = 
\\v\\ = 1, (A, A) - (A',X') perpendicular to (A, A) in K nx ™ x K, and v - v' per- 
pendicular to v in K™. 

Notation: for short, let A x := (XL n - A) and A A , := (X'I n - A'). 
By Wedin's Theorem (see Stewart-Sun [20], Theorem 3.9) we have 



A; 



U vl ^A' y 



l + \/5 



Since 



< 

tl v ±A x 

t 



(n v ,±A' x 



IL V ±A X -Il v ,±A' x , 



< 



t 



n v ±A x ) - (ii„,±A 



, then, 







^A 


)' 




1 1 + v 7 ^ 
x 2 






U v ±A x -fl v ,±A' x , 



Note that 



< 



n„±A\ -Tl„,,xA> 



n v ,±A x ~n vl ±A' x , 



lA - lV iA A 

< 2.||A A ||-dr(t;,t/) + ||A A -^,||, 

where the second inequality follows from Lemma \3. 61 Moreover, taking into account 
that (A, X,v) £ W and the choice of elected representatives, we get 

\\A X -A' X ,\\ < + |A- A'| 

< V2-d T ((A,X),(A',X'))-y!\\A\\l, + \X\2 

< 2-d T ((A,X),(A',X% 

and hence 

\tl.^A x - TL V ,±A X , J < 4 • d T (v, v') + 2- d T ((A, A), (A', A')). 
Then we conclude 



n v ^A' x , 



< 





(n^A A ) f 




1- (l + v / 5)2 v / 2- 




\ t 

„^a ) 


■d T 2((A,X,v),(A',X',v')) 



1(5 
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The proposition follows from the following fact: < 1 + ||A — A'\\f < 1 + 

V2d T ((A,X),(A',X')). " ' □ 

Proposition 3.7. Givene > 0, there existc e > suchthat, if(A,X,v), (A',X',v') € 
W and 

d T *((A,X,v),(A',X',v'))< \ 

then, 

fi(A',X',v') < (l + e) f i(A,X,v). 
(One may choose c £ = —t= — |=-— — -, where a = (1 + v5)2\/2.) 

y 2v 2+v 2a{l-\-e) 

Proof. It is enough to prove the assertion for fj, v instead of \x. 
Recall from Lemma HOI that fx v is bounded below by 1/V2. Hence, 

d T *((A,X,v),(A',X',v')) < 

implies 

d T ,((A,X,v),(A',X',v'))< 

fj, v {A,X,v) 

From Proposition \3.6\ if c is such that \[2c < l/a and 

l + 2y/2c 



1 - V2c 

we get the result. 
One may choose c E = 



< 1 +£, 



2 v / 2 + v / 2a(l + e)' 



□ 



Corollary 4. Given e > 0, there exist c' e > such that, if (A, X, v), (A 1 , A', i/) € W 
and 



d p2 ((AA,«),(A',A', w ')) < 



A*(A A, u) 

li{A',X',i/) < (l + s)fi(A,X,v). 
( One may choose c' = arctan ( — m — ] where a := (1 + ^/b)2^/2. ) 

1 e \ 2V2+V 2a(l+e) / 

Proof. By Lemma \3.4\ if 

d p2 ((A,A,«),(A',AV)) < / 

then 

d T2 ((AA,«),(A',A',«')) < ^•^((AV),(^,A>')) 

^ tan(c') 



proving the lemma. □ 
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Proof of Proposition \S '. 51 From Corollary^ there exist c' > such that, if (A, A, v), 
(i',A',»')eW are such that 

d p2 ((A,\v),(A',X'y)) ■ fi{A,X,v) < J, 

then 

»(A',X',v') < (1 + e)/x( J 4,A,w). 
It is enough to take d such that c' < arctan ( — ; ) . In this case we have 

" \ 2V2+V2a(l+£) J 

d p2 ((A, A, v), (A>, A', v')) ■ n{A',\', v') < c'(l + e). 

Then, by the same argument, if c'(l + e) < arctan ( 2 V2+vL»(i+e) ) we nave * ne 
other inequality. □ 

4. Newton's Method 

In this section we start describing the Newton method defined in the Introduc- 
tion. The main goal of this section is to prove Theorem [7] 

4.1. Introduction. Let us recall the definition of the Newton map on P(K" xn x 
K) x P(K"). We define 

N(A,X,v) := (A,N A (X,v)), 

where Na is the Newton map, given in (jl.ip . associated to the evaluation map 
Fa(\,v) = (AJ n — A)v, for a fixed (non-zero) matrix A and (A,u) € IK x IK™. 
Note that Na has the simple matrix expression 



N A 



X\ fX\ fv XI n -A\ 1 fX 



vl \ v I \ 



To compute the Newton map we have to solve for (A, v) S IK x K n the following 
linear system: 

Aw + (XI n - A)v = (XI n - A)v, 
(v,v) = 0. 

Then one gets: 

Lemma 4.1. IfTl v ±(XI n — A)\ v ± is invertible, then the Newton iteration is given 
by 

N(A,X,v) = (A,X — X,v — v), 

where 

v = (lv(aj„ -^)|„x) _1 n w x(A/ n - A)v, 

A _ ((XI n -A)(v-v),v) 
(v,v) 

From Lemma \4-l\ we conclude that N is a well-defined map on the product 
space P(K" X ™ x Kj x P(K n ). Moreover, for a fixed matrix A € K" xn , A ^ n , we 
conclude also that the map Na is well-defined on K x 
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4.2. 7-Theorem. In order to prove Theorem [I\ and TTieorem [H we need to obtain 
a version of the 7-Theorem for the Newton map N A : K x P(K n ) ->Kx P(K n ). 

Proposition 4.1. Let < u < l/(2v / 2). 

7e£ (A, A, u) € W sucft to ||A|| F = 1, and let (Ao, no) € K x P(K"). 
// 

flAo-Af+dpfo,*;) 2 ) 1 /^ " , 

M(A,A,w) 

t/ien, the Newton sequence (Xk,Vk) ■— 7Vj[(Ao,vo) satisfies 

(\X k - A| 2 + d r (v k ,v)r /2 < (f^) • I})' _1 ■ (|Ao - A| 2 + d P (, ,.) 2 ) 1/2 , 
/or a// k > 0. 

This proposition will be the main tool to prove Theorem Q] and also Theorem 
It is a version -for the Newton map Na- of a fairly known theorem in the literature, 
namely, the 7-Theorem, which gives the size of the basin of attraction of Newton's 
method. In our case, for the Newton map N A reads: 

Theorem 3. There is a universal constant Co > with the following property. 
Let (A, A, v) € W such that \\A\\ F = I, and (A ,v ) e K x P(K"). 7/ 

(|A -A| 2 + d P (tt y 2 ) 1/2 < C ° 

MA A,u) 

iften, i/ie sequence (Afe,Ufe) = 7V^(Ao,uo) converges immediately quadratically to 
(A, u) wii/i respect to the canonical distance in IK x P(K n ). 
fOne may choose cq = 0.288,). 

Since we do not find an appropriate place to refer to this version, we include a 
proof of the Proposition \4-l\ in the Appendix. Note that the proof of Theorem 
follows directly from Proposition \4-l\ picking u such that: < u < 1/(2V2) and 
2tan(tt)/(l - V2u) < 1. 

4.3. Proof of Theorem [TJ 

4.3.1. Preliminaries. 

Lemma 4.2. Fix a representative of (A, A, v) £ V such that \\A\\f = 1 and \\v\\ = 1. 
Lei (A>')eKxP(K n ). 

(1) If |A- A'| < c < v/2, ^en, 

d p2 ((A, A, «), (A, A>')) < P c ■ (|A - A'| 2 + v ') 2 ) 1/2 , 

where /3 C = (1 - c 2 /2)- 1 / 2 . 

(2) 7/<i P 2((A,A,«),(AA / ,i/)) <6><7r/4, tften, 

(|A - A'| 2 + dr(«, «') 2 ) 1/2 < Re ■ dp* ((A A.«), (A, \',v')), 
where Rg = [^2/ cos(6> + tt/4) 3 ] 1 / 2 . 
The proof of Lemma \4-'A 1S included in the Appendix. 

Let #0 such that Rb 6q = l/(2\/2), where i?e is given in Lemma \4-£\ (do ~ 
0.1389). 
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Proposition 4.2. Let < u < 6 - 

Let (A,X,v), (A,Xo,v ) £ P(K™ X ™ x K) x P(K"). If (A, A, v) e W and 

dpi((A,\,v),{A,\a,va)) < . — r, 

fj,[A,X,v) 

then 

d P 2(N k (A,X ,v ),(A,X,v)) < 
for all k > 0, where S(u) := u/(l — u). 

Proof. With out loss of generality we may assume \\A\\p — 1 and ||v|| = 1. 
By Lemma \4--2\ ~we get 

(4.1) (|A„-AP + 

Since u < 6q, we have uR u < l/(2y/2), and then from Proposition \4-l\ and Propo- 



sition UhM we get 

(4.2) (|A fe - X\ 2 + d P (v k ,v) 2 )^ 2 < 

forallfc> 0, where (Afc, Vk) ■— N\(Xq, vq) . Moreover, since (|A -A| 2 +rfp(w , v) 2 ) 1 / 2 < 
uR Ul we deduce from Lemma \4-£\ that 

d P *{N k (A,X ,v ),(A,X,v)) < 

2tan(wi? u ) \ /l x 



<RuPuR u I j— J ■ U) ■d r 2((A,X ,v ),(A,X,v)). 

(Note that u < 9 < f .) □ 
4.3.2. Proof of Theorem^ 

Proof of Theorem{l\ From Proposition ^ -S\ proof of T7ieorem[7]follows picking uq > 
such that uo < and Ru fiu a R u ( ^ ta "i"° < l Q ne ma y choose uo = 

0.0739. □ 



5. Proof of the Main Theorem 

5.1. Complexity Bound. In the introduction we defined the condition length of 
an absolutely continuous path r : [a, b] — > W as 

W= I \\f(t)\\ m »(T(t))dt. 



The next proposition is useful for our Main Theorem [H 



20 



COMPLEXITY OF EIGENVALUE PROBLEM 



Proposition 5.1. Given e > 0, C e > as in Proposition \3.5\ and T : [a,b] — > W 
an absolutely continuous path with ifj,(T) < oo, define the real sequence {s k } k =o....,K 
in [a, b] such that: 

• so = a ! 

•Sk such that fj,(T(s k -i)) ||r(i)|| r (t)<fe = C E , 

whenever n(T(s k -i)) f gk i ||f (t)\\ r (t)dt > C £ ; 

• else define s k = sk = b. 

Then, 

1 + e 

a 



Jf<-^-^(T) + l. 



Proof. Given s £ [a fc _i, s fe ], d P 2 (r(s fc _i), r(s)) < ||f (t)\\ m dt < Mr^-i))" 1 C e 
By the first inequality in Proposition Iff. 51 we get 

»t( t )\\ m »(T(t))dt> r iif(t)ii r(4) di=-^-, 



whenever k < K. Since ^(r) < oo, if < oo, and adding, yields 

a 



^(r)>(ir-i) T 



□ 



5.2. Proof of the Main Theorem [H 



Proof of Theorem^ Let A(t), a < t < b, be a representative path of the projection 
7r(T) c P(K" X ") such that \\A(t)\\ F = 1 for a < t < 6. 

Given a mesh a = to < ii < • • • < £k = b, for fc = 0, 1, . . . , K , let T(t k ) = 
(A(t k ),X k ,v k ) e K" x ™ x K x P(K"), be the sequence described in Section{TE 
Then, if T(t k ), f (t k ), T(t k+1 ) are such that, 

d P 2(r(t k ),r(t k+1 )) < —£ and d P ,(T(t k ),f(t k )) < ( 



then, 

dp2(r(t fc+1 ), (A(i fc+1 ), Afc,w fc )) < 

< d P 2(r(i fe+1 ),r(i fe )) + 4o(T(t fc ),f (t fc )) + 

dp2(r(tfe), (A(t k+ i),X k ,v k )) 

2C 

- 7F77 w +dr2(T(t k ),{A(t k+1 ),X k ,v k )). 

Note that 

d V 2 (f (t k ), (A(t k+ i),X k ,v k )) = dv((A(t k ),\ k ), (A(t k+ i),X k )). 
Since = 1, a < i < b, then, abusing notation, we get 

d r ((A(t k ),X k ),(A(t k+l ),X k )) < d P {A(t k ),A(t k+1 )), 
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where the inequality follows from a direct application of the law of cosines. More- 
over, 

rtk+i 

d P (A{t k ),A(t k+1 )) < / \\A{t)\\ A[t) dt 
Jt k 

tk + l 

tfe 
tk+l 



< V2 \\D,y x {T{t))A(t)\\ (A(t)Mt)) dt 

< V2 [ k+1 \\t(t)\\ m dt, 



where the second inequality follows from the trivial lower bound which one may 
obtain from Q3.3p . 

Assuming J f tfc+1 ||f (i)|| r(t) dt < C e /n(T(t k )) we conclude 

d V 2(Y(t k+ i), (A(t k+1 ), X k ,v k j) < ~ 1 ' ~ 



m(T(<0) ' 

Since dp2 (T(t k ), T(t k+ i)) < jjr^h^jj , from Provosition \3.5\ we get 

(l + e)(2 + v / 2)C £ 



.(T(tin-i),(i4(tfc+i),A fc ,t;fc)) < 



M(r(t fc+ i)) 

From Proposition\4J% if u := (1 + e)C £ {2 + y/2) < 6 , then 

d, 2 (N(A(t k+1 ),X k ,v k )),T(t k+1 )) < 

< Rup uRu )^-) - ■ dp2(T(t k+1 ),(A(t k+1 ),X k ,v k )) 

\1 - V2uR u J I 



R 

< 



a ( 2tan(Mfl u ) \ 1 



M(r(tfc+i)) 

Then, if e is small enough such that u < 9q and R u fiuR u \ \ u < Ce, 

we get that r(ifc + i) is an approximate solution of the eigenvalue problem T(t k +i). 
Then, the proof of Theorem [H can be deduced applying Proposition I5.il to the e 
selected before. □ 

Remark 5.1. One can take e = 0.2448. Then, C e « 0.010383, and one can choose 
C = 120. 



6. Appendix 

This section is divided in two parts. In the first one we include a proof of 
Proposition \4-l\ In the second part we prove Lemma {4~!>\ 

6.1. Proof of Proposition I4TT1 Throughout this subsection, when ever we fix a 
representative of (A,X,v) £ W such that \\A\\f = 1 and ||u|| = 1, we will consider 
the canonical Hermitian structure on K x K". 
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6.1.1. Preliminaries and Technical Lemmas. 

Lemma 6.1. Let {A, A, v) £ W and ?/ e P(K n ) sucft i/iai d P («, i>') < tt/2. 

Let : V /_L — > u 1 - 6e i/ie restriction of the orthogonal projection IL V ± of 



\ n onto u /_L . TTien, 



^ur 1 " ! 



cos(dp(v, v')) ' 



2) Pick a representative of (A, X,v) £ W suc/i i/iot ||j4||f = 1 awd ||u|| = 1- Then, 
(i) 

|| (^F A (A, V )| Kx „^) _1 -i?F A (A, w )| Kx ^|| = 



cos(dp(w, u')) ' 
(ii) 



(DfU(A )W )| Kx „,x) '||< 



cos(dp(v, v')) 



Remark 6.1. In part 2) and 3) of the preceding lemma, we consider the spaces 
IK x v 1 - and K x u /J ~ as subspaces of K x K n with the canonical Hermitian structure. 

Proof. 1): Follows by elementary computations. 

2)- (i): For (A, »)eKx ir 1 , let (?), w) £ K x v' 1 " such that 

(t7>) - (^F A (A,«)| Kx „^) _1 -^F A (A,«)| Kx ^(A,«). 

Then, 

fjv + (XI n - A)w = Xv + (XI n - A)v. 
Since (A, A, v) £ W, we deduce that f\ = X and H v ±w = v. Then, we conclude that 

(DFa(X,v)\ Kxv ,±)~ 1 ■ DFa(X,v)\ Kxv ±(X,v) = (A, (n„j.| l) ,j_) _1 ('i))). 

Taking norms, and maximizing on the unit sphere in IK x v 1 , (i) follows from 1). 
2)-(ii): Note that 

|| (DF A (X,v)\ Kxv ,±)~ 1 || < 

|| (DFa(X, v)\ Kxv ,±)~ 1 ■ DFa(X, v)\ Kxv ± I • || (^(A^kx^r 1 II, 

then apply 2)-(i). □ 

Lemma 6.2. Let (A,X,v) £ K nxn x K x IK" such that \\A\\ F = 1 and \\v\\ = I, 
then ||L> 2 F A (A,w)|| < 1. 

Proof. Differentiating two times Fa, we get 

D 2 F A (X,v)(X,w)(i),u) = Xii + rjw, for all (A, w), (f), u) £ K x K". 

Then, 

\\D 2 F A {X,v){X,w)(r,,u)\\ < \X\ ■ \\u\\ + \t]\ ■ \\w\\ 

< (|A| 2 + |H| 2 ) 1/2 -(N 2 + lkf) 1/2 , 
where the second inequality follows from Cauchy-Schwarz. □ 

We recall the fairly known Neumann Lemma (or Banach Lemma): 
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Lemma 6.3 (Neumann Lemma). Let E be a Hermitian space, and A, Ie : E — > E 
be linear operators where Ie is the identity. If \\A — < 1, then A is invertible 
and 

1 



\A~ l \\ < 



1- WA-Ik 



Proposition 6.1. Let < u < 1/(2^/2). 

Let (A, A, v) G W, such that \\A\\ F = 1, \\v\\ = 1, and (A , v ) G K x P(K n ). If 
(IXo-X^ + dT^v) 2 ) 1 / 2 < 



DF A (A, v) 



then the Newton sequence (\k,Vk) '■= N a (Xq,Vo) satisfies 

(\X k - A| 2 + d T (v k , v) 2 ) 1 ' 2 < V2 ■ 6(V2u) • ■ (|A - A| 2 + d T (v , v) 2 ) 1 / 2 , 

for all k > 0, where 5(u) :— u/(l — u). 

Proof. Take a representative of vq such that (v — vq,vo) — 0. Note that ||i>o|| • 
dr(v, v ) = \\v - v \\ and ||w || < 1. 

In particular, the hypothesis implies that 

\\DF A (X, v)\ Kxv ±~ 1 \\ ■ 1 1 ( A - A,w - w)|| < u. 
Taylor's expansion of Fa and DFa in a neighborhood of (A, v) are given by 

(6.1) F A {\',v') = DF A (X,v)(X' - X,v' - v) + X - ■ D 2 F A (X,v)(X' - X,v' - v) 2 , 



and 
(6.2) 
One has 



DF A (X', v') = DF A (X, v) + D 2 F A {\, v){\' - X,v' - v). 



(r>F A (X,v)\ 



DF A (X ,v ) 



DF A (X,v)\ 



DF A (X ,v )\ 



-DF A (X,v)\ 



= (DF A (X,v)\ Kxvo± ) 1 ■D 2 Fa(X 1 v))(X -X 1 v -v)\ 
Then, taking norms, we get 

(dF a {X,v)\ Kxvo± ^ ■ DF A (X ,v )\ Kxvo± - I Kxvq x 



< 



< 
< 



DF A (X,v)\ K> 
1 

cos (d r (v, «o)) 



\D 2 F A (X,v))(X - X,v -v)\ 



DF A (X,v) 



|| (A -X,v - v)\\, 



where the last inequality follows from Lemma \6.1\ and Lemma[ 
In the range of angles under consideration ||w || = cos(dp(v, vq)) > I/a/2- Then, by 
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the condition < u < l/(2\/2) we can deduce from Lemma ROl that DFa{Xo: v o)\ Kxvo ± 
is invertible and 



(6.3) 



(dF a (\ ,v )\ KxvoX ) X -DF A (\v) 



< 



< 



1 



cos(d r (v,v )) 

Moreover, 

N A {Xo,v )-{X,v) 

= (A - X,v - v) - ^DF A (X 0l v ) 

Ikxd j 



DF A (X,v)\ Kxv ± • || (A - X,v -v)\\ 



Kx»o J 



F A (X ,v ) 



DF A (X ,v )\ 

■ ^DF A (Xo,v )\ Kxvg± (X -X,v -v)- F A (Xo,v )^j 

Then, from (16.1[) we get 

N A (X ,v ) - (X,v) = 

= l - ■ (dF a {X ,v )\ k ^ vo± ) 1 ■ D 2 F A (X,v)(Xo -X,v - vf 
Taking the canonical norm in IK x K™, we get 
||^a(A ,vo)-(A,u)|| < 

< i- |^F a (A ,«o)Ikx,o^ 1 || •||^ 2 ^(A, w )(Ao-A, Uo -w) 2 ||. 
Then, from (|6.3p and Lemma \6. 1\ we get 
(6.4) ||iV A (Ao,«o)-(A,i;)||< 

< 



V2- 


DF A (X,v)\ Kxv ± 1 


■±-\\D 2 F A (X,v)(X a ~X,v -v) 2 \\. 


l-y/2- 


DF A (X,v)\ Kxv ± 1 


■ || (A - A,u - v)\\ 



Therefore, from Lemma 1 6. H yields 
ll^(A ,«o)-(A,«)|| < 



< 



V2- 


DF A (X,v)\ Kxv ± 1 


■ || (A - A,u - u)| 


1-V2- 


DF A (X,v)\ Kxv ± 


i 


• || (A - A,« - u)|| 



|| (A - A,u - 



Then, 

||JVa(A 0) «o)-(A ) «)|| < 



< 



V2- 


DF A (X,v)\ Kxv ± 1 




(|Ao- 


A| 2 + 


d T {v^vyy/ 2 


l-y/2- 


DF A (X,v)\ Kxv ± 


l 


• (|A 


-A| 2 


+ d T (v ,v)iy/2 



•i(|A -A| 2 + d T K^) 2 ) 1/2 . 

Let (Xi,vi) := N A (X ,v ). 
From the proof of Lemma W^ we have D 2 F A (X, v)(Xq — A, w — w) 2 = 2(A — X)(v — v), 
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then, from (|6.4[) one can deduce that — v\\ < 6(y/2u)\\vo — v\\, where S(u) = 
u/(l — u). Since u < l/(2y/2), we have 6(y/2u) < 1, then from Lemma 2, (4) of 
Blum et al. [I] (page 264) we get 



d T (vi,v)< l ^j^<V2-\\v 1 -v\\. 



Hence 

(6.5) QXi-Xf + M^v) 2 ) 1 / 2 < 



< 



2 • 


DF A (X,v)\ Kxv ± 1 


•(|A -A| 2 + d T («o,«) 2 ) 1/2 


1 - V2 • 


DF A (\,v)\ Kxv ± 


-l 


.(\\ -\\* + d T (v ,v) 2 )V 2 



■l(\\ a -\\ 2 + d T (v ,v) 2 ) 1/2 . 



Therefore, 



(6.6) (|Ax - A| 2 + dr(vi,v) 2 ) 1/2 <V2-5(V2u) ■ \ (|A - A| 2 + d T (v 0l v) 2 ) 1 ' 2 . 
From (|6.6[) , (|6.5p . and the fact that 6(y/2u) < 1, working by induction we get 

(\X k - A| 2 + d T (v k , v) 2 ) 1 ' 2 < V2 ■ S(V2u) ■ • (|A - A| 2 + d T (v , v) 2 ) 1 / 2 , 

for all k > 0, where (Xk,Vk) '■— N^(X 0l v Q ) . 



□ 



Proposition 6.2. Let (A,X,v) £ W, such that \\A\\p = 1 and \\v\\ = 1. Then, 
n(A,\,v) < \\DFa (A, v) Ikxd^^ 1 ! <2-fx(A,X,v), 

Proof. Since the action of U„(K) on P(K n ) is transitive, we may assume that v 
(1,0,..., 0) T . Then, completing to a basis ofIxw 1 ,we have that 



A = 



X w 
A 



DF A (X,v)\ 



Kxv A 



1 — W 

o n v ±(xi n - A)\ v , 



where weK lx '" 1) . 
Note that (DFa(X,v)\ Kxv i 



1 w(H v ± (XI n — A)\ v ± )~~ 1 
(U v ^(XI n - A)]^)- 1 



Hence 



\\DF A (A, v) \ Kxv ± '\\ > max{l, ||OV (XL n - ^4) |t>^ )~ 1 II } = KA A, v). 
On the other hand, 

\\DF A (X,v) | K x^ _1 || < 



< 



1 wiU^iXIn-A)]^)- 1 






o (a^ixin-A)^)- 1 

< max{i, \\w(n v ^(xi n - A)\ v ±)~ 1 \\} + ||(n^(A/ n - ^U)- 1 !! 

< 2-»(A,X,v). 



□ 



2(5 
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6.1.2. Proof of Proposition 



Proof of Proposition 
tion\6.2\SLiid Lemma 
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The proof follows directly from Proposition I6.il Proposi- 

□ 



6.2. Proof of Lemma 14.21 

Lemma 6.4. Let A E K nxn , A ^ 0„, such that \\A\\ F = 1. Let A, A' 6 K such that 
|A|<1. 

(1) If | A' — A| < c for some < c < y/2, then, there exists (3 C > 1 such that 

d r ((A,X),(A,X))<p c -\X'-X\. 

One may choose (3 C = (1 — c 2 /2)~ 1 / 2 . 

(2) If d r ((A,X),(A,X')) < 9 for some < 9 < tt/4, then, there exist R g > 1 
sitc/i that 

\X' -X\<R r d P ((A,X),(A,X')). 
One may choose Rg = \\f2j cos(6> + 7r/4) 3 ] 1/2 . 
Proof. Let := dp((A, A), (A, A')). By the law of cosines we know that 

|A - Af = 1 + |A| 2 + 1 + |A'| 2 - 2 ■ y^T+W ■ Vl+W? ■ cosO- 

Then, 

(6.7) |A-A'| 2 = ( V TTlAF-V^T|A 7 F) a + 



+2 • y/l + |A| 2 ■ ^1 + |A| 2 • (1 - cos 0). 

From (I6.7[) we get that 

|A - A'| 2 > 2 ■ ^1 + |A| 2 • VT+W ■ (1 ~ costf), 

i.e. 

(6.8) - - ^ ^' A, - A ' 2 

Therefore , 1 — cos 6* < ^- , and hence the angle 9 is bounded above byarccos(l — c 2 /2). 
By the Taylor expansion of cosine near we get the bound 

* 2 < T -^.(l-cose). 

Then, from (|6.8|) we can deduce the upper bound in (1). 

For the lower bound in (2), we rewrite the cosine law and we get: 

, |2 ( |A| 2 -|A'| 2 V , 



|A-A'| 



+2v/l + |A| 2 - v/l + |A'| 2 -(l-cos« 
Since ||A| - |A'|| < |A - A'| and 1 - cos (9 < 6> 2 /2, then, 



sni I l A l + l A 'l V u W|2 



(6.9) |A- A'| 2 < — 1 1 1 J -|A-A'| 2 + 



\'|2 . fl2 
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Since < |A| < 1, is easy to see that 



|A| + |A'| 1 + |A'| 



Moreover, by elementary arguments one can see that |A'| < tan(0 + 7r/4), and 
therefore one can get 

|A| + |A'| < 1 + tan(0 + tt/4) 



^T+W+V^+W 72+^1 + ^ + 77/4)2 

tan(0 + tt/4) 
Jl + tan((9 + 7r/4) 



tan(0 + 7r/4) 
< V 7 7 =sin(6> + 7r/4). 



Then, from (|d3]) . 



and hence 



i A _ A /ia< yrw-v/nw .2 



A - A < 



cos(6< + tt/4) 



V2 



2 



cos(<9 + rr/4) 3 



□ 



Remark 6.2. Note that if (A, A) e 7Ti(V) C P(K nx ™ x K) then |A| < ||A|| F is always 
satisfied. 

6.2.1. Proof of Lemma 



Proof of Lemma \J7i£\ The proof of (1) and (2) follows directly from de definition of 
dp2 and Lemma \6.4\ O 
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